Improving Chi-Square Feature Selection using a Bernoulli Model for Multi-label Classification of Indonesian-Translated Hadith

نویسندگان

چکیده

Hadith is the foundational knowledge in Islam that must be studied and practiced by Muslims. In Hadith, several types of teachings are beneficial to Muslims all mankind. Some serve as advice, while others contain prohibitions should adhere to. There yet do not belong these categories only information. This study focuses on increasing performance Chi-Square feature selection obtain relevant features for multilabel classification Indonesian-translated Bukhari data. proposes a Chi-Square-based Bernoulli model improve which appropriate short-text data such Hadith. The findings this show proposed method can select based classes; thereby improving with an error value 9.38% compared (9.91%) obtained using basic selection.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Feature Selection for Improving Multi-Label Classification using MEKA

The extensive dimensionality in multi-label classification can be overcome by selecting representative words that describe an instance and removing the redundant and insignificant ones. The popular technique of feature selection when applied reduces the size of the dataset and hence speeds up and improves the accuracy of the learning process of classification. This paper looks at the performanc...

متن کامل

Feature Selection for Multi-label Classification Problems

This paper proposes the use of mutual information for feature selection in multi-label classification, a surprisingly almost not studied problem. A pruned problem transformation method is first applied, transforming the multi-label problem into a single-label one. A greedy feature selection procedure based on multidimensional mutual information is then conducted. Results on three databases clea...

متن کامل

Conditional Bernoulli Mixtures for Multi-label Classification

Multi-label classification is an important machine learning task wherein one assigns a subset of candidate labels to an object. In this paper, we propose a new multi-label classification method based on Conditional Bernoulli Mixtures. Our proposed method has several attractive properties: it captures label dependencies; it reduces the multi-label problem to several standard binary and multi-cla...

متن کامل

MLIFT: Enhancing Multi-label Classifier with Ensemble Feature Selection

Multi-label classification has gained significant attention during recent years, due to the increasing number of modern applications associated with multi-label data. Despite its short life, different approaches have been presented to solve the task of multi-label classification. LIFT is a multi-label classifier which utilizes a new strategy to multi-label learning by leveraging label-specific ...

متن کامل

Efficient Multi-Label Feature Selection Using Entropy-Based Label Selection

Abstract: Multi-label feature selection is designed to select a subset of features according to their importance to multiple labels. This task can be achieved by ranking the dependencies of features and selecting the features with the highest rankings. In a multi-label feature selection problem, the algorithm may be faced with a dataset containing a large number of labels. Because the computati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Advanced Computer Science and Applications

سال: 2021

ISSN: ['2158-107X', '2156-5570']

DOI: https://doi.org/10.14569/ijacsa.2021.0121268